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1.  INTRODUCTION 


7. - - 


r  s  -  r  b  •  s 


t.  - 


The  following  problenris  of  practical  and  theoretical  importance  in 
reliability. 

Under  study  is  a  coherent  structure  consisting  of  n  independent  com¬ 
ponents.  The  structure  is  observed  continuously  until  it  fails.  For  com¬ 
ponents  failing  before  or  at  system  failure  time,  we  observe  complete  life 
lengths;  for  components  still  functioning  at  system  failure  time  we  observ 
censored  lifc'lengths .  For  the  system  we  observe  system  lifelength.  From 
a  sample  of  m  such  structures  we  wish  to  estimate  the  system  lifelength 
distribution. 

As  far  as  we  know,  tnis  problem  has  not  been  solved  or  even  treated 
in  the  literature. 

In  this  paper  we  propose  an  estimator  of  the  system  lifelength  distri 
bution.  As  a  by-product,  we  also  obtain  estimates  of  the  mean  and  quan¬ 
tiles  of  the  system  lifelength  and  of  all  the  corresponding  quantities  for 
component  lifelengths.  In  Section  2  we  give  several  asymptotic  results 
concerning  the  estimators.  In  Section  3  we  present  and  discuss  two  boot¬ 
strap  schemes  that  are  used  in  small  samples  to  assess  the  variability  of 


our  estimates  and  to  construct  confidence  intervals. 


*  f&JU 


NOTATION 


We  assume  that  the  system  and  each  component  are  in  either  a  function 

ing  state  or  a  failed  state.  Thus,  we  can  define  the  lifelengths  of  the 

components  and  of  the  system.  Let 

=  the  lifelength  of  component  j  in  system  i;  i  . 

J 

T^  =  the  lifelength  of  system  i;  | 


6.  .  =  I  (X.  .  <  T. ) ; 

1J  1J 

be  the  distribution  function  of  X„,  and  the  component  reliability  func¬ 
tion  be  S.(t)  =  l-  F.(t); 

J  3 

Fs  be  the  distribution  function  of  T\  ,  and  the  system  reliability  function 
be  Ss(t)  =  1  -  Fg(t); 

H.  be  the  distribution  function  of  Z...  and  H.(t)  denote  l-ll.(t). 

)  ij  J  3 

In  the  definitions  above  the  letter  i  indexes  systems  and  the  letter  j 
indexes  components.  Throughout  the  paper  i  ranges  from  1  to  m,  and  j  from 


1  to  n.  The  random  variables  X. .  are  not  observed.  We  observe  onlv  the 

ij 

Z .  . '  s  and  6 .  .  ' s . 

13  13 

It  is  helpful  to  keep  in  mind  a  concrete  example.  Figure  1  below 
shows  diagrammaticallv  a  simple  structure  of  3  components,  arranged  neither 
in  series  nor  in  parallel.  We  carrv  this  example  throughout  the  paper.  In 
the  example  (the  subscript  i  indexing  systems  has  been  suppressed)  T  = 


*1  A  ^2  V  ’  w^ere  x  Ay  =  min(x,  y)  and  xvy  =  max(x,  y) 


Figure  1  X^ =  lifelength  of  component  i,  i  =  l,  2,  3. 


We  note  that  Fg  can  be  estimated  naively  by  the  empirical  distribution 


function  of  the  T. 's: 

i 


Femp(t)=-  I  I(T.st)  for  t  SO. 
s  m  .  L  ,  i 


However,  it  is  clear  that  this  estimator  does  not  use  all  the  information 


available  in  the  sample. 


THE  ESTIMATOR 


For  any  coherent  structure  of  n  independent  components  there  corre¬ 
sponds  a  function  h:  [0,  l]n -*•[(),  1],  such  that 

12)  Ss(t)  =h(S1(t),  ....  Sn(t))  for  tiO. 

See  Chapter  2  of  Barlow  and  Proschan  (1975)  for  details.  In  the  example 
given  by  Figure  1,  we  have 

Ss(t)  «Sj(t)[l-  (1  -  S2(t))(l  -  S3(t))l , 

so  that 


hfUj,  u2>  u  )  =  u  [1  -  (1  -  u2) (1  -  u3)]  for  Uj,  u2>  u3 e  [0.  1] . 


To  construct  our  estimator,  we  first  estimate  the  S^(t)'s  by  the 


Kaplan-Meier  estimates 


(3) 


S.(t)  « 


n 


i:  Z  St 

(i)  j 


m  -  i 


m  -  i  +  1 


6(i)j 


for  t  2  0 . 


(Z,..  .<...<  Z,  .  .  denote  the  ordered  values  of  Z,.,  . . . ,  Z  . ,  and  6 . . .  . , 
(1)3  (n)j  13  "I  (l)j 


...,  are  the  6's  corresponding  to  ....  Z^j,  respectively) 


and  then  substitute  S^(t)  for  S ^  (t)  in  (2).  The  estimator  is  defined  by 


(4) 


ss(t)H 


"‘V1’ . Snlt”  ift<T(m) 


if  t  2  T 


(«) 


Here,  T,  .  =  max  T. 


(m) 


We  have  shown  that  the  estimate  Fg(=  1  -  Sg)  is  the  nonparametric  mle 
of  F  .  We  do  not  present  a  formal  proof  here,  but  instead  offer  the 


4 


following  heuristic  argument.  It  is  well  known  (Kaplan  and  Meier,  1958; 
Johansen,  1978)  that  the  Kaplan-Meier  estimate  is  the  nonparametric  mle  of 
a  distribution  function  when  the  data  is  randomly  right  censored.  An 
extension  of  this  result  is  that  (S^,  ...,  S  )  is  the  nonparametric  mle  of 
(S^,  ....  Sn);  the  invariance  principle  for  mle's  implies  that 

h(Sj...,  S  )  is  the  nonparametric  mle  of  h(Sj . S^) . 

Equation  (4)  provides  the  basis  for  the  estimation  of  functionals  of 
Fs  such  as  quantiles  and  the  mean.  For  example,  we  can  estimate  F  *(p)  by 

~  -  1  roo  ~ 

F^  (p) ,  for  p  e  (0,  1)  .  Similarly  we  can  estimate  ug  =  J^tdE^t)  by  us  = 


f0tdFs(t) 


2 .  ASYMPTOTIC  RESULTS 

In  this  section  we  describe  the  limiting  behavior  of  the  estimates  of 
the  system  and  component  reliability  functions.  As  an  application,  we  show 
how  our  results  can  be  used  to  improve  the  system  reliability.  The  proofs 
of  the  theorems  stated  below  will  appear  elsewhere. 

We  begin  with  an  important  result  concerning  the  simultaneous  estima¬ 
tion  of  the  component  reliability  functions.  For  T>0,  D[0,  T]  denotes  the 
space  of  real  valued  functions  on  [0,  T]  that  are  right  continuous  and  have 
left  limits,  with  the  Skorohod  metric  topology  (see  Chapter  3  of  Billings¬ 
ley,  1968),  and  Dn[0,  T]  denotes  the  product  metric  space. 

THEOREM  A.  Suppose  F^,  F^  are  continuous,  and  let  T  be  such  that 

F  j  (T)  <  1  for  j  =  1 ,  . . . ,  n .  Then  as  m  » 

in4  (S.  -  S, ,  S.-S.,  ....  S  -  S  )  -*•  (W  ,  W  ,  ....  W) 

1122  nn  12  n 

in  Dn[0,  T) ,  where  W^,  ....  are  independent  mean  0  Gaussian  processes, 
with  covariance  structure  given  by 


(5) 


Cov(W  (tj),  w.(t2))  = 


Sj(tl)  Sj(t2) 


t  dF  •  (u) 
M 

0 


Hj  (u)S. (u) 


for  0  <  t1  <  t2  S  T. 


The  weak  convergence  of  the  Kaplan-Meier  estimator  to  a  Gaussian  pro¬ 
cess  has  been  well-established  in  the  literature  (Breslow  and  Crowley,  1974; 
Gill,  1983)  under  the  assumption  that  the  lifelengths  and  the  censoring 
variables  are  independent.  In  our  situation  the  component  lifelengths  are 
censored  by  the  system  lifelength,  and  the  independence  condition  is  clearly 
violated.  We  can,  however,  redefine  the  censoring  variables  to  bypass  this 
difficulty.  This  is  easiest  to  explain  in  terms  of  the  example  given  by 
Figure  1.  Consider  Component  1.  Clearly,  X  is  censored  by  Y  =X0vX_, 
which  is  independent  of  X ^ .  Similarly,  X2  is  censored  by  Y2  =  Xj,  and  X,  by 

Y _  =  X  The  construction  is  general:  for  an  arbitrary  system,  X.  is  cen- 

^  i  J 

sored  by  Y^  =  lifelength  of  system  if  X ^  is  replaced  by  ®.  One  can  check 
that 


(i)  X.  and  Y.  are  independent, 

3  J 

( i i )  min(X.,  T)=min(X.,  Y.). 

3  J  y 

Thus,  the  known  weak  convergence  results  for  the  Kaplan-Meier  estimate 
apply  to  the  individual  S^'s. 

For  fixed  m,  the  S^’s  are  in  general  dependent.  This  is  easily  seen 
in  the  example  given  by  Figure  1,  in  which  Components  2  and  3  are  both  cen¬ 
sored  by  Component  1.  For  complicated  systems  the  dependence  may  be  com¬ 
plex.  Thus,  the  novel  results  given  by  Theorem  A  are  first,  the  joint 
asymptotic  convergence  of  the  S^'s  and  second,  thei-r  asymptotic  independ¬ 
ence. 

The  asymptotic  independence  of  the  S^’s  is  interesting.  Before  prov¬ 
ing  Theorem  A  we  conjectured  this  result  by  considering  the  two  special 
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cases  of  parallel  and  series  systems.  These  are  often  viewed  as  extreme 
cases  in  reliability  theory,  and  an  analysis  of  these  cases  may  shed  light 
on  the  dependence  structure  between  the  S^'s.  For  parallel  systems  there 
is  no  censoring  at  all;  the  F^’s  are  the  usual  empirical  cdf's  and  are 
trivially  independent  for  every  m.  For  a  series  system  (say  of  just  two 
components)  we  have 


(6)  m  CovfFjCt).  F2(t))  =m  EF^tjF^t)  -m  EF^tjEF^t)  . 

Since  Fj(t)F2(t)  = F®mP(t)  (see  (1)),  the  first  term  on  the  right  side  of 

(6)  is  m  Fs(t).  Consider  now  the  second  term  on  the  right  side  of  (6). 

From  Efron  (1967)  we  obtain  the  bounds 

(7)  0  s  Fj(t)  -  EFj(t)  s  Sj(t)e'mHj(t')  for  j»  1,2. 

(Actually,  Efron,  1967,  has  the  inequalities  reversed.  This  is  because  he 
uses  the  version  of  the  Kaplan-Meier  estimate  that  is  defined  to  be  0  past 
the  last  observation,  whether  or  not  it  is  censored.)  Combining  (6),  (7) 
and  the  fact  that  F ^ (t) F^ (t )  * F  (t) ,  we  obtain  for  series  systems  that 

(8)  m  Cov(Fj(t),  F  (t))-»-0  exponentially  fast  as  m 

This  proves  that  S^(t)  and  S^ft)  are  asymptotically  uncorrelated  and 
hence  asymptotically  independent,  assuming  that  joint  asymptotic  normality 
has  been  established.  Since  intuitively  the  series  structures  give  rise  to 


maximum  possible  dependence  among  the  S^’s,  we  were  led  to  conjecture  the 
asymptotic  independence  of  the  S^'s  for  general  structures.  Our  proof  of 
Theorem  A  does  indeed  give  the  result  (8)  for  arbitrary  systems. 


To  prove  Theorem  A  we  show  that  for  each  m,  the  vector  of  processes 


is  approximately  (as  m-*®)  a  martingale  with  respect  to  the  o-field  gener¬ 
ated  by  all  uncensored  component  deaths  observable  bv  time  t.  We  then 
apply  an  appropriate  martingale  central  limit  theorem*  via  the  Cramer-Wold 
device,  to  deduce  the  result. 

The  next  theorem  gives  the  asymptotic  normality  of  our  estimator  of 
system  reliability. 


THEOREM  B.  Suppose  F^,  F^,  .  ,  F^  are  continuous,  and  suppose  T  is  such 

that  F  .  (T)  <  1  for  j  =  1 ,  2,  ....  n.  Then  as  m  -*•  ® 

m^(S  -  S  ) -*•  W  weakly  in  D[0,  T] , 


where  W  is  a  mean  0  Gaussian  process  with  covariance  structure  given  by 


Cov (W(t  1 ) ,  IV ( 1 2 ) ) 


r  ■ 


...  un) 


(Uj,  ....  uj- 
(Sjftj) . snct1n 
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Since  this  estimate  is  well  known  to  be  consistent,  it  is  clear  that  one 
can  consistently  estimate  the  asymptotic  variance  of  Sg(t)  given  by  Theorem 
B.  This  enables  the  construction  of  asymptotic  confidence  intervals  for 
Ss(t). 

We  close  this  section  with  an  applicatin  of  Theorem  A  to  the  joint 
estimation  of  the  reliability  importance  of  components.  The  reliability 
importance  I  (t)  of  component  j  at  time  t  is  defined  by 

(u  ,  ....  u  )  = 

1  n  . 

(SjCt),  ....  sn(t)) 


I.(t)  »r2-  h(V 

i  3u . 

J  J 


V 


Let  e,,  ....  e  be  small  numbers.  Note  that 
1  n 


h(S 


11 

jfO+ej,  ..  ,  Sn(t),cn)-h(Sl(t) . Sn(t))  =  .  I  iej  1  j  ft)  - 


Thus,  the  reliability  importance  of  components  may  be  used  to  evaluate  the 
effect  of  an  improvement  in  component  reliability  on  system  reliability, 
and  can  therefore  be  very  useful  in  system  anal) sis  in  determining  those 
components  on  which  additional  research  can  be  most  profitably  expended. 

For  details,  see  pp.  26-28  of  Barlow  and  Proschan  (1975),  and  the  review  by 
Natvig  ( 1984) . 

Notice  that 


I.(t)  -h.fSjCt) . Sn(t))* 

where  h ^ :  [0,  l]n -*■[(),  1]  is  some  smooth  function.  Thus, 
Ij(t),  a  natural  choice  is 


to  estimate 


r.(t)  =h.(s1(t),  ...,  sn(t)). 
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THEOREM  C.  Suppose  F^,  are  continuous  and  T  > 0  is  such  that 

max  F  .  (T)  <  1 .  Then  as  hi  +  » 
l<j<n  J 

mYl  -  I  ,  ....  I  -  I  )  -»•  (Y  ,  ....  Y  ) 

11  n  n  1  n 

weakly  on  Dn[0,  T]  ,  where  (Y^,  ....  Y  )  is  a  vector  of  mean  0  Gaussian 
processes  whose  covariance  structure  is  given  by 


for  0  <  t1  s  t2  <T  and  jj,  j2  =  1,  .  .  .  ,  n. 

\c  ~ 

For  fixed  t,  the  asymptotic  normality  of  m  (I^(t)  -  I  ^  f  t ) . 

I  (t)  -  In(t))  follows  from  Theorem  A  and  an  application  of  the  delta 
method.  The  theorem  follows  from  an  easy  extension  of  this  argument  to  the 
process  {m'5 (I  j  (t)  -  I  j  (t) ,  .  .  .  ,  I „  (t)  -  1r  (t) )  ;  t  e  [0,  T] }  . 


5.  BOOTSTRAPPING  SCHEMES 

1 

Although  the  estimates  Fs(t),  Fg  (p) ,  and  are  easy  to  describe  and 
their  asymptotic  distributions  relatively  simple,  their  finite  sample  dis¬ 
tributions  (particularly  for  F  *(p))  are  completely  intractable.  In  prac¬ 
tice,  we  will  need  to  supplement  any  estimate  with  an  estimate  of  its  stan¬ 
dard  error,  and  in  fact  an  estimate  of  its  whole  distribution.  In  this 
section  we  discuss  bootstrapping  schemes  for  estimating  the  distribution 


METHOD  2.  In  this  estimate  of  the  model  P,  component  j  has  lifelength  dis¬ 
tribution  F j ,  for  j  =  1,  n;  so  we  construct  artificial  systems  by 

resampling  component  lifelengths  from  the  Kaplan-Meier  estimates  . 

A  formal  description  of  the  algorithm  is 

(1)  Generate  X  j  ~  F  ^  for  j  =  1,  2,  ...»  n  independently.  This  gives  one 
artificial  system  which  we  denote  by  Sys**. 

(2)  Repeat  Step  1  independently  m  times  giving  one  sample  of  artificially 
constructed  systems,  denoted  Sys**,  ...»  Sys*m. 

(3)  Compute  n*  base^  on  the  m  systems  in  Step  2. 

(4)  Repeat  Steps  1,  2  and  3  independently  B  times,  obtaining  n**>  n*“,  ... 

D 

n*  .  Figure  2  schematically  describes  this  method. 


X*  ~  F 
11  1 


- {2) - 

- - 

•  •  •  —  — 

x;2^2 

X*  ~  F. 
ml  1 

- - 

- (T) - 

X*  ~  F 
13  3 


X*  ~F_ 
m3  .> 


Sys*1  has  lifelength  T*  ...  Sys*m  has  lifelength  T* 

Figure  2.  In  Method  2,  the  estimate  n*  is  based  on  the  data  Z?.  = 

min (X*  .  ,  Tt),  6^  =1  (X*  .  <Tt). 
ij  i'  ij  ij 


Note  that  our  version  of  S^  given  by  (3)  is  strictly  positive  for 

t  smaxZ.  if  maxZ..  corresponds  to  a  censored  observation.  In  this  case, 
i  1 J  i  U 

we  view  F^  as  giving  mass  to  the  point  thus  X^  is  equal  to  <»  with  posi¬ 
tive  probability. 

In  applying  either  method,  the  n*^'s  are  used  in  the  usual  way  to  make 
inference  about  G.  Let  G*  equal  the  empirical  cdf  of  the  n*^'s.  The  stan¬ 
dard  deviation  of  G  is  estimated  by 
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9)  o-tr-^-T  I  (n*b- n**)2)4 

B*  1  b  =  1 

j  B 

where  n**  =  t  T  n*b.  The  standard  deviation  is  not  a  particularly  mean- 
B  b :  i 

ingful  quantity  if  G  is  asymmetric  or  far  from  normal,  which  may  occur  when 
the  sample  size  is  small.  In  that  case  the  intervals  n!  z  a,  where  z^  is 
the  100  •  a  percentile  point  of  a  standard  normal  variate  and  o  is  given  by 

(9)  are  essentially  useless.  As  alternatives,  we  can  use  the  percentile 
intervals  of  level  a  (a  e  (0,  1))  given  by 

(10)  [G;-l(o),  G*-l(l -a)], 

and  the  more  elaborate  bias  corrected  and  BC  intervals  (see  Efron  1984a, 

a 

b)  ,  all  based  on  G* . 

J  m 

In  comparing  the  two  methods  of  bootstrapping,  it  is  helpful  to  make 
an  analogy  with  the  regression  model  in  which  we  observe  random  pairs 
( Y ^ ,  X.),  i  =  l,  ...,  m,  where 


Y .  =  X. B  +  e.  , 

ill 


X.  is  a  p  dimensional  vector  of  covariates,  8  is  a  p  dimensional  vector  of 
unknown  coefficients,  to  be  estimated,  and  are  iid  from  an  unknown  dis¬ 
tribution  F  on  R1,  centered  at  0  in  some  sense.  Let  6  be  an  estimate  of  6, 
whose  variability  we  wish  to  assess. 

One  way  of  bootstrapping  is  to  resample  the  pairs  (Y^,  X^) ,  and  con¬ 
struct  an  artificial  value  6*  based  on  the  resampled  pairs.  This  corre¬ 
sponds  to  our  Method  1 . 

Another  way  to  bootstrap  is  to  resample  the  residuals 


e.  =  Y.  -  X.B. 

ill 
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Let  F  be  the  empirical  cdf  of  e,,  ....  t  ,  and  let  et»  •••»  e*  be  iid~F  . 

m  1  m  1mm 

We  construct  Y?=X(g  +  e?,  i  =  l,  ....  m,  from  which  we  can  obtain  an  artifi- 
ii  l 

cial  value  g*.  This  corresponds  to  our  Method  2. 

Method  2  makes  more  use  of  the  structure  in  our  model,  in  particular 
the  assumption  of  independence  of  the  component  lifelengths.  We  view  this 
both  as  a  strength  and  a  weakness. 

We  expect  Method  2  to  be  "preferable",  but  we  have  not  carried  out  any 
studies  to  assess  the  two  methods.  We  hope  to  pursue  this  problem  and 
report  the  results  in  a  future  paper. 

Efron  (1981)  has  discussed  use  of  the  bootstrap  on  censored  data.  He 
considered  the  standard  setup  for  randomly  right  censored  data,  which  cor¬ 
responds  to  two  components  arranged  in  series  in  our  model.  He  showed  that 
for  this  case,  the  two  methods  of  bootstrapping  are  identical .  The  two 
methods  are  not  always  the  same  in  our  situation,  as  can  easily  be  seen  by 
considering  a  parallel  structure  of  two  components. 

It  would  be  of  interest  to  study  the  asymptotic  behavior  of  the  boot¬ 
strap  in  our  problem.  This  would  determine  whether  confidence  intervals 
based  on  the  bootstrap  are  asymptotically  valid.  To  carry  out  such  a 
study,  it  would  be  necessary  to  select  some  relatively  simple  estimates; 
Fs(t)  where  t  is  fixed  is  a  prime  choice.  Let  F*(t)  denote  a  bootstrap 
replication  of  Fs(t),  obtained  by  one  of  the  two  methods.  Suppose  we  can 
show  that  for  almost  every  sample  sequence  {(Z.^,  6  —  )!  j  =  1,  •••,  n, 
i  =  1 ,  2 ,  3,  ...}, 


(12) 


mJs(F*(t)  -  Fs(t))  and  «Vs(t)  -  F(t)) 
have  the  same  asymptotic  distribution. 


It  would  then  follow  that  the  simple  percentile  intervals  (10)  are  asymp- 


totically  correct  to  the  first  order,  in  the  terminology  of  Efron  (1984b). 

Hjort  (1985)  has  shown  that  in  the  standard  setup  for  randomly  rieht 
censored  data,  the  bootstrap  approximation  is  asymptotically  correct  to  the 
first  order.  More  specifically,  suppose  that  are  iid~F,  are  iid~G, 
independently  of  the  X's,  and  that  the  data  is  (X^aY^,  I(X^<Y^)), 
i  =  1,  ...,  m.  Let  F  be  the  Kaplan-Meier  estimator,  and  let  F*  be  the 
empirical  cdf  of  a  random  sample  of  size  m  from  F.  Hjort  (1985)  showed  that 
under  certain  regularity  conditions,  for  fixed  t,  with  probability  one. 


(13) 


m^(F* (t)  -  F(t) )  and  m^(F(t)  -  F(t)) 
have  the  same  asymptotic  distribution. 


This  result  offers  hope  that  (12)  is  true  under  some  reasonable  set  of 
assumptions. 
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wise  a  censoring  time  is  recorded.  We  introduce  a  method  for  finding  estimates 
of  F(t)  based  on  the  mutual  censorship  of  the  component  lifelengths  inherent 
in  this  model.  We  present  limit  theorems  that  enable  the  construction  of 
confidence  intervals  in  large  samples.  For  small  samples,  we  describe  and  dis¬ 
cuss  bootstrap  schemes  that  can  be  used  to  implement  the  method. 
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